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RESULT 9 

US-09-991-181 -84 

Sequence 84, Application US/09991181 
Patent No. 6913 919 
GENERAL INFORMATION: 
APPLICANT: Ashkena zi , Avi J. 
APPLICANT: Baker, Kevin P. 
APPLICANT: Bots tein , David 
APPLICANT: De snoyer s , Luc 
APPLICANT: Eaton, Dan L. 
APPLICANT: Ferrar a , Napoleone 
APPLICANT: Fong , Sherman 
APPLICANT : Gerber , Hanspet er 
APPLICANT: Gerritsen, Mary E. 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gr imaldi , J . Chr is topher 
APPLICANT: Gurney , Aus tin L. 
APPLICANT: Kljavin,Ivar J. 
APPLICANT: Napier, Mary A. 
APPLICANT: Pan, James 
APPLICANT: Paoni , Ni cholas F. 
APPLICANT: Roy, Margaret Ann 
APPLICANT: Stewart, Timothy A. 
APPLICANT: Tumas, Daniel 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Williams, P. Mickey 
APPLICANT: Wood, William I. 
APPLICANT: Zhang, Zemin 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same ' 
FILE REFERENCE: P2 73 0P1C53 

CURRENT APPLICATION NUMBER: US/0 9/9 91,181 
CURRENT FILING DATE: 2001-11-16 
PRIOR APPLICATION NUMBER: 60/049787 
PRIOR FILING DATE: 1997-06-16 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/065186 
PRIOR FILING DATE: 1997-11-12 
PRIOR APPLICATION NUMBER: 60/065311 
PRIOR FILING DATE: 1997-11-13 
PRIOR APPLICATION NUMBER: 60/066770 
PRIOR FILING DATE: 1997-11-24 
PRIOR APPLICATION NUMBER: 60/075945 
PRIOR FILING DATE: 1998-02-25 
PRIOR APPLICATION NUMBER: 60/078910 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/084 600 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/087106 
PRIOR FILING DATE: 1998-05-28 
PRIOR APPLICATION NUMBER: 60/087607 
PRIOR FILING DATE: 1998-06-02 
PRIOR APPLICATION NUMBER: 60/087609 
PRIOR FILING DATE: 1998-06-02 
PRIOR APPLICATION NUMBER: 60/087759 
PRIOR FILING DATE: 1998-06-02 
PRIOR APPLICATION NUMBER: 60/087827 
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PRIOR FILING DATE : 1998-06-03 

PRIOR APPLICATION NUMBER: 60/088021 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/088025 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/088026- 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/088028 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/088 029 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/088 030 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/0 88 033 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/088326 

PRIOR FILING DATE: 1998-06-04 

PRIOR APPLICATION NUMBER: 60/0 88167 

PRIOR FILING DATE: 1998-06-05 

PRIOR APPLICATION NUMBER: 60/088202 

PRIOR FILING DATE: 1998-06-05 

PRIOR APPLICATION NUMBER: 60/088212 

PRIOR FILING DATE: 1998-06-05 

PRIOR APPLICATION NUMBER: 60/088217 

PRIOR FILING DATE: 1998-06-05 

PRIOR APPLICATION NUMBER: 60/088655 

PRIOR FILING DATE: 1998-06-09 

PRIOR APPLICATION NUMBER: 60/088734 

PRIOR FILING DATE: 1998-06-10 

PRIOR APPLICATION NUMBER: 60/088738 

PRIOR FILING DATE: 1998-06-10 

PRIOR APPLICATION NUMBER: 60/088742 

PRIOR FILING DATE: 1998-06-10 

PRIOR APPLICATION NUMBER: 60/0 88 810 

PRIOR FILING DATE: 1998-06-10 

PRIOR APPLICATION NUMBER: 60/088824 

PRIOR FILING DATE: 1998-06-10 

PRIOR APPLICATION NUMBER: 60/0 88 826 

PRIOR FILING DATE: 1998-06-10 

PRIOR APPLICATION NUMBER: 60/088858 

PRIOR FILING DATE: 1998-06-11 

PRIOR APPLICATION NUMBER: 60/088861 

PRIOR FILING DATE: 1998-06-11 

PRIOR APPLICATION NUMBER: 60/088876 

PRIOR FILING DATE: 1998-06-11 

PRIOR APPLICATION NUMBER: 60/089105 

PRIOR FILING DATE: 1998-06-12 

PRIOR APPLICATION NUMBER: 60/089440 

PRIOR FILING DATE: 1998-06-16 

PRIOR APPLICATION NUMBER: 60/089512 

PRIOR FILING DATE: 1998-06-16 

PRIOR APPLICATION NUMBER: 60/089514 

PRIOR FILING DATE: 1998-06-16 

PRIOR APPLICATION NUMBER: 60/089532 

PRIOR FILING DATE: 1998-06-17 

PRIOR APPLICATION NUMBER: 60/089538 

PRIOR FILING DATE: 1998-06-17 

PRIOR APPLICATION NUMBER: 60/089598 

PRIOR FILING DATE: 1998-06-17 

PRIOR APPLICATION NUMBER: 60/089599 

PRIOR FILING DATE: 1998-06-17 
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PRIOR APPLICATION NUMBER: 60/089600 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/089653 
PRIOR FILING DATE: 1998-06-17 
PRIOR APPLICATION NUMBER: 60/089801 
PRIOR FILING DATE: 1998-06-18 
PRIOR APPLICATION NUMBER: 60/089907 
PRIOR FILING DATE: 1998-06-18 
PRIOR APPLICATION NUMBER: 60/0 89 908 
PRIOR FILING DATE: 1998-06-18 
PRIOR APPLICATION NUMBER: 60/089947 
PRIOR FILING DATE: 1998-06-19 
PRIOR APPLICATION NUMBER: 60/0 89948 
PRIOR FILING DATE: 1998-06-19 
PRIOR APPLICATION NUMBER: 60/0 89 952 
PRIOR FILING DATE: 1998-06-19 
PRIOR APPLICATION NUMBER: 60/090246 
PRIOR FILING DATE: 1998-06-22 
PRIOR APPLICATION NUMBER: 60/090252 
PRIOR FILING DATE: 1998-06-22 
PRIOR APPLICATION NUMBER: 60/090254 
PRIOR FILING DATE: 1998-06-22 
PRIOR APPLICATION NUMBER: 60/090349 
PRIOR FILING DATE: 1998-06-23 
PRIOR APPLICATION NUMBER: 60/090355 
PRIOR FILING DATE: 1998-06-23 
PRIOR APPLICATION NUMBER: 60/0 90429 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090431 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090435 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090444 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090445 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090472 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090535 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090540 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090542 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090557 
PRIOR FILING DATE: 1998-06-24 
PRIOR APPLICATION NUMBER: 60/090676 
PRIOR FILING DATE: • 1998-06-25 
PRIOR APPLICATION NUMBER: 60/090678 
PRIOR FILING DATE: 1998-06-25 
PRIOR APPLICATION NUMBER: 60/090690 
PRIOR FILING DATE: 1998-06-25 
PRIOR APPLICATION NUMBER: 60/090694 
PRIOR FILING DATE: 1998-06-25 
PRIOR APPLICATION NUMBER: 60/0 90 695 
PRIOR FILING DATE: 1998-06-25 
PRIOR APPLICATION NUMBER: 60/090696 
PRIOR FILING DATE: 1998-06-25 
PRIOR APPLICATION NUMBER: 60/090862 
PRIOR FILING DATE: 1998-06-26 
PRIOR APPLICATION NUMBER: 60/0 90 863 
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PRIOR FILING DATE: 1998-06-26 

PRIOR APPLICATION NUMBER: 60/091360 

PRIOR FILING DATE: 1998-07-01 

PRIOR APPLICATION NUMBER: 60/091478 

PRIOR FILING DATE: 1998-07-02 

PRIOR APPLICATION NUMBER: 60/091544 

PRIOR FILING DATE: 1998-07-01 

PRIOR APPLICATION NUMBER: 60/0 91519 

PRIOR FILING DATE: 1998-07-02 

PRIOR APPLICATION NUMBER: 60/0 91626 

PRIOR FILING DATE: 1998-07-02 

PRIOR APPLICATION NUMBER: 60/0 91633 

PRIOR FILING DATE: 1998-07-02 

PRIOR APPLICATION NUMBER: 60/091978 

PRIOR FILING DATE: 1998-07-07 

PRIOR APPLICATION NUMBER: 60/091982 

PRIOR FILING DATE: 1998-07-07 

PRIOR APPLICATION NUMBER: 60/0 92182 

PRIOR FILING DATE: 1998-07-09 

Query Match 66.8%; Score 3160; DB 2; Length 867 ; 

Best Local Similarity 64.8%; Pred . No. 2.2e-303; 

Matches 577; Conservative 120; Mismatches 140; Indels 54; Gaps 13; 

Qy 7 ALVLAVLGT ELLGSLCSTVRSPRFRGRIQQERKNIRPNII LVLTDDQDVELGSLQVM 63 

:||| =| Ml : : I = I I I :: I = I II I I I I I I I I I I I I I I I I h I I I 

Db 5 SLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNI ILVLTDDQDVELGSMQVM 64 

Qy • 64 NKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEP 12 3 

I II hi I I III II II II Mill II II I hi II II II II I I Ml II II MM II II 

Db 65 N KT RR I M EQ GG AH F I NAFVTT PMCCPSRSSILTGKYVHNHNTYTNNENCSSPSWQAQHES 124 

Qy 124 RTFAVYLNNTGYRTAFFGKYLNEY NGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKEKHG 183 

IIIMMhllMMIIMMMIMIhlllhlhlhllllllllhlllhlllll 

Db 125 RTFAVYLNSTGYRTAFFGKYLNEYNGSYVP PGWKEWVGLLKNSRFYNYTLCRNGVKEKHG 18 4 

Qy 184 FDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVI SHAAPHGPEDSAPQFSKLYPNASQ 24 3 

I ! : I I I |||||:|:::|: | h I I I I I I h I I I M I I I I I I II II I h h h I M I I 
Db 185 SDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVI SHAAPHGPEDSAPQYSRLFPNASQ 244 

Qy 244 HITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVET 303 

I II MM II II I II II h Mill I II II II h II INI II 1 1 II II hi MM II II 

Db 245 HITPS YNYAPN PDKHWIMRYTGPMKPIHMEFTNMLQRKRLQT LMSVDDSMETIYNMLVET 304 

Qy 304 GELENTYII YTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVLNID 363 

llhlllhlllllllllllllMllllllhllllllhMlhM I : I MUM 

Db 3 05 GELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLN PHIVLNID 364 

Qy 3 64 LAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKE 42 3 

I II MM II II I I h lllhll II I hi III II - I I I : I I I I I I I I h: 
Db 3 65 LAPTILDIAGLDI PADMDGKSILKLLDTERPVNRFHLKKKMRVWRDSFLVERGKLLHKRD 424 

Qy 4 24 ESSKNIQQSNHLPKYERVKELCQQARYQTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLL 4 83 

: h I lllhllhllhl II II II I II lllhll = II I = = 1 1 1 M I I 

Db 4 25 NDKVDAQEENFLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLG 48 4 

Qy 4 84 TVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQT 54 3 

I : II : : : hi h I = h =: I = III MM 

Db 4 85 GSR-ALSNLVPKYYGQGSEACTCDSGDYKLSLAGRR-KKLFKK KYKASYVRSRSI 53 7 

Qy 544 RSLSVEFEGEI YDINLEEEEELQV LQPRNI AKRHDEGH KG PRDLQASSGGNRGRMLADSS 603 
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||:::| :| :| : | : Mlh Ml I I I I I II 
Db 538 RSVAI EVDGRVYHVGLGD AAQPRNLTKRH WPG- -APEDQDDKDGG DFS 583 

Qy 604 NAVGPP TTVRVTH KCFILPNDSIHCERELYQSARAWKDHKAYIDKEIEALQDKIK 658 

II ; M I I M M I I M : I : M I M M I M I I Ml Ml M M 1 1 

Db 5 84 GTGGLPDYSAANPIKVTHRCYILENDTVQCDLDLYKSLQAWKDHKLHIDHEIETLQNKIK 64 3 

Qy 659. NLREVRGHLKRRKPEECSCSKQSY YNKE KGVKKQEKLKSHLHPFKEAAQEVDSKLQLFKE 718 

I II -IN I I II- II I I Mlh: Mlh : | 

Db 644 NLREVRGHLKKKRPEECDCHKISYHTQHKGRLKHR- - GSSLHPFRKGLQEKD-KVWLLRE 700 

Qy 719 NNRRRKKERKEKRRQRKGEECSLPGLTCFTHDNNHWQTAPFWNLGSFCACTSSNNNTYWC 778 

MM I II M : : I hi II II MM I II II II Ml I II MM MM II II 

Db 701 -QKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLGPFCACTSANNNTYWC 75 9 

Qy 779 LRTVNETHNFLFCEFATGFLEYFDMNTDPYQLTNTVHTVERGILNQ LHVQLMELRSCQGY 83 8 

M I M 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I h h M M 1 1 1 1 1 1 II 1 1 1 1 h 1 1 ■ 

Db 7 60 MRTINETHNFLFCEFATGFLEYFD LNTDPYQLMNAVNT LDRDVLNQLHVQLMELRSCKGY 819 
Qy 839 KQCNPRPKNLDVGNKDGGSYDLHR GQLWDGWEG 871 

I MM I MM: MMh M II II Mill 

Db 820 KQCNPRTRNMDL DGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 867 
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RESULT 5 

US-09-668 -673B-16 

Sequence 16, Application US/09668673B 
Patent No. 6562956 
GENERAL INFORMATION: 
APPLICANT: Emerson, Charles P 
APPLICANT: Dhoot, Gurtej K 



TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



IDENTIFICATION AND CLONING OF A NEW SUBFAMILY OF 
SULFATASES AND FUNCTIONAL EMBRYONIC TECHNIQUES FOR 
CHARACTERIZATION OF SUCH PROTEINS 
FILE REFERENCE: PENN-073 3 

CURRENT APPLICATION NUMBER: US/0 9/6 68 , 6 73B 
CURRENT FILING DATE: 2000-09-22 
PRIOR APPLICATION NUMBER: 60/155,73 8 
PRIOR FILING DATE: 1999-09-23 
NUMBER OF SEQ ID NOS : 22 
SOFTWARE: Patentln Ver . 2.0 
SEQ ID NO 16 

LENGTH: 1611 

TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: ( 819) 
OTHER INFORMATION 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (840) 
OTHER INFORMATION 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (844) 
OTHER INFORMATION 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (852) 
OTHER INFORMATION 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (858) 
OTHER INFORMATION 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (865) 
OTHER INFORMATION 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (875) 
OTHER INFORMATION 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (878) 
OTHER INFORMATION 
FEATURE- 
NAME/ KEY: UNSURE 
LOCATION: (881) 
OTHER INFORMATION 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (888) 

OTHER INFORMATION: any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 
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FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (896) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY : UNSURE 
LOCATION: (907) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY : UNSURE 
LOCATION: (910) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: . UNSURE 
LOCATION: (915) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (927) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (943) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (945) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (948) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (954) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (959) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (971) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (974) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (1018) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (1046) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (10 80) 
OTHER INFORMATION: 
FEATURE : 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 
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NAME/ KEY: UNSURE 
LOCATION: (10 89) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY : UNSURE 
LOCATION: (1102) . . 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (1105) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (1121) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (1127) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (1191) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (1199) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (12 23) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (12 35) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (12 50) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (13 07) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (13 21) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (13 56) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (13 62) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (13 82 ) . . 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: UNSURE 



any amino acid 



(1103) 

any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



(1383) 

any amino acid 
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LOCATION: (13 97) 

OTHER INFORMATION: any amino acid 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (1431) 

OTHER INFORMATION: any amino acid 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (14 37) 

OTHER INFORMATION: any amino acid 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (14 48) 

OTHER INFORMATION: any amino acid 
FEATURE : 

NAME/KEY: UNSURE 
LOCATION: (14 58) 

OTHER INFORMATION: any amino acid 
FEATURE : 

Query Match 94.3%; Score 4460; DB 2; Length 1611; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 818; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 54 DVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCS 113 

I II II II II II MM II lllllll llllllllll II MM MM MM II IIIIIIM II 

Db 1 DVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCS 60 

Qy 114 SPSWQAMHEPRTFAVYLNNTGYRTAFFGKY LNEYNGSYIPPGWREWLGLI KNSRFYNYTV 173 

I ! I i I 1 1 [ I [ I 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 SPSWQAMHEPRTFAVYLNNTGYRTAF FGKY LNEYNGSYIPPGWREWLGLI KNSRFYNYTV 12 0 

Qy 174 CRNGIKEKHGFDYAKDYFTDLITNESINYFKMSKRMYPHR PVMMVI SHAAPHGPED SAPQ 23 3 

I II II II II II II II M II M III M M II II II M II II II I! M M M MM M II M 

Db 121 C RNGI KE KH GFDYAKDYFTDLITNES IN YFKMSKRMYPHR PVMMVI SHAAPHGPED SAPQ 18 0 

Qy 234 FSKLYPNAS QHITPS YNYAPNMDKHWIMQY TG PMLPIHMEFTNILQRKRLQTLMSVDDSV 293 

1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 i 1 1 1 1 1 1 f 1 1 1 1 1 1 1 

Db 181 FSKLY PNASQHITPS YNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSV 24 0 

Qy 294 ERL YNMLVETGELENTY 1 1 YTADH GYHI GQ FGLVKGKSMP YD FD IRVPFF IRGPSVEPGS 353 

III MM II II II II II MMIM MM MM II llllll MM II M II MM MM II 

Db 241 ERLYNMLVETGELENTYII YTADH GYHIGQ FGLVKGKSMPYDFD IRVPFF IRGPSVEPGS 300 

Qy 3 54 I VPQI VLNI DLAPTI LD IAGLDTP PD VDGKS VLKLLDP EK PGNR FRTNKKAK I WRJDTF LV 413 

I II II II II II II II M II M III II M II II II M II II II II M II M II II M M II 

Db 3 01 I VPQI VLNI DLAPTI LD IAGLDTP PDVDGKSVLKLLDPEKPGNRFRTNKKAK I WRDTFLV 36 0 

Qy 414 ERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARYQTACEQPGQKWQCIEDTSGKLRI 473 

I II II II M II II II II M II IMM II M II II II II M M M II M II M M II II II 

Db 3 61 ERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARYQTACEQPGQKWQCIEDTSGKLRI 420 

Qy 4 74 HKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPKY 53 3 

I II II II II II II II II II II III MM II II II II II II II II II II II M M II II II 

Db 4 21 HKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPKY 48 0 

Qy 534 KPRFVHTRQTRSLSVEFEGEI YDINLEEEEELQVLQPRNIAKRHDEGHKGPRDLQASSGG 593 

I II II II II II II II II II II III II II II II II M II II II M I! II II II II M II !l 

Db 4 81 KPRFVHTRQTRSLSVEFEGEI YDINLEEEEELQVLQPRNIAKRHDEGHKGPRDLQASSGG 54 0 

Qy 5 94 NRGRMLADSSNAVGPPTTVRVTHKCFILPNDSIHCERELYQSARAWKDHKAYIDKEIEAL 653 
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Db 


541 


Qy 


654 


Db 


601 


Qy 


714 


Db 


661 


Ov 


774 


Db 


721 


Qy 


834 


Db 


781 



1 1 1 1 F [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I F I 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 I 



QDKIKNLREVRGHLKRRKPEECSCSKQSYYNKEKGVKKQEKLKSHLHPFKEAAQEVDSKL 713 

1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 i 1 1 1 1 E I E 1 1 1 1 1 1 1 1 ill 

QDKIKNLREVRGHLKRRKPEECSCSKQSYYNKEKGVKKQEKLKSHLHPFKEAAQEVDSKL 66 0 
QLFKENNRRRKKERKEKRRQRKGEECSLPGLTCFTHDNNHWQTAPFWNLGSFCACTSSNN 77 3 

I II II II II II II II II II II III I! II II I! II II II II II II II II II II II II II II 

QLFKENNRRRKKERKEKRRQRKGEECSLPGLTCFTHDNNHWQTAPFWNLGSFCACTSSNN 72 0 
NTYWCLRTVNETHNFLFCEFATGFLEYFDMNTDPYQLTNTVHTVERGILNQLHVQLMELR 83 3 

IIIIMIIIIIIIMIIIIIIMI IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

NTYWCLRTVNETHNFLFCEFATGFLEYFDMNTDPYQLTNTVHTVERGILNQLHVQLMELR 78 0 
SCQGYKQCNPRPKNLDVGNKDGGSYDLHRGQLWDGWEG 8 71 

Ml MM II II II IMM! Mill MINI MM Ml 

S CQ GY KQ CN PR PKNLDVGN KDGGS YD LH RGQL WD GWEG 818 
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RESULT 1 
SULF1_HUMAN 

ID SULF 1_HUMAN ' STANDARD; PRT ; 871 AA. 

AC Q8IWU6; Q86YV8; Q8NCA2 ; Q9UPS5 ; 

DT 19-JUL-2003, integrated into UniProtKB/Swiss -Prot . 

DT 01-MAR-2003, sequence version 1. 

DT 07-FEB-2006, entry version 26. 

DE Extracellular sulfatase Sulf-1 precursor (EC 3.1.6.-) (HSulf-1) . 

GN Name=SULFl; Synonyms =K I AA1 077 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorda ta ; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBIJTaxID=9 60 6; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] , CHARACTERIZATION, .AND MUTAGENESIS OF 

RP 87-CYS-CYS-88. 

RC TISSUE=Prostate; 

RX MEDLINE=22370956 ; PubMed=123 68295 ; DOI= 10 . 1 074/ jbc . M2 05 13 12 00 ; 

RA Morimoto-Tomita M . , Uchimura K., Werb Z. , Hemmerich S., Rosen S.D.; 

RT "Cloning and characterization of two extracellular heparin- degrading 

RT endosulf atases in mice and humans."; 

RL J. Biol. Chem. 277:4 9175-49185(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [MRNA] , AND MUTAGENESIS OF 87 -CYS -CYS - 88 . 

RX MEDLINE=22692523 ; PubMed=l 26 86 563 ; DOI= 10 . 1 07 4/ jbc . M3 02 20 32 00 ; 

RA Lai J., Chien J., Staub J., Avula R., Greene E.L., Matthews T.A., 

RA Smith D.I., Kaufmann S.H., Roberts L.R. # Shridhar V.; 

RT "Loss of HSulf-1 up-regulates heparin-binding growth factor signaling 

RT in cancer. " ; 

RL J. Biol. Chem. 2 78:23107-23117(2003). 

RN [3] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 54-871. 

RC TISSUE=Brain; 

RX MEDLINE=99397452 ; PubMed=l 04 70 851 ; DOI=10 . 1 09 3 /dnares /6 .3 .197 ; 

RA Kikuno R., Nagase T. # Ishikawa K. , Hirosawa M., Miyaj ima N. , 

RA Tanaka A., Kotani H., Nomura N . , Ohara 0.; 

RT "Prediction of the coding sequences of unidentified human genes. XIV. 

RT The complete sequences of 10 0 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 6:197-205(1999). 

RN [4] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 312-871. 

RC TISSUE=Teratocarcinoma ; 

RX PubMed=14702039; DOI =1 0 . 10 38 /ngl2 85 ; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., 

RA Wakamatsu A., Hayashi.K., Sato H . , Nagai K. , Kimura K., Makita H. , 

RA Sekine M., Obayashi M. , Nishi T., Shibahara T., Tanaka T. , Ishii S., 

RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K. , 

RA Murakami K., Yasuda T. , Iwayanagi T., Wagatsuma M., Shiratori A., 

RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M. , 

RA Takahashi M . , Kanda K. , Yokoi T., Furuya T. , Kikkawa E . , Omura Y. , 

RA Abe K. , Kamihara K., Katsuta N . ; Sato K., Tanikawa M . , Yamazaki M., 

RA Ninomiya K., Ishibashi T., Yamashita H. , Murakawa K., Fujimori K. , 

RA Tanai H. , Kimata M . , watanabe M . , Hiraoka S., Chiba Y., Ishida S., 

RA Ono Y., Takiguchi S., Watanabe S. f Yosida M., Hotuta T. , Kusano J . , 

RA Kanehori K., Takahashi -Fuj ii A., Hara H., Tanase T.-O., Nomura Y. , 

RA Togiya S., Komai F., Hara R. , Takeuchi K. , Arita M. , Imose N. , 

RA Musashino K. , Yuuki H. , Oshima A., Sasaki N . , Aotsuka S, t 

RA Yoshikawa Y., Matsunawa H. , Ichihara T. , Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Satoh N. , Takami S., Terashima Y. , Suzuki O., 
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RA Nakagawa S., Senoh A., Mizoguchi H. , Goto Y., Shimizu F . , Wakebe H. , 

RA Hishigaki H. , Watanabe T . , Sugiyama A., Takemoto M. , Kawakami B., 

RA Yamazaki M . , Watanabe K. , Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M . , Tashiro H., Tanigami A., Fujiwara T. ( 

RA Ono T., Yamada K., Fujii Y., Ozaki K. , Hirao M. , Ohmori Y., 

RA Kawabata A., Hikiji T. , Kobatake N. , Inagaki H., Ikema Y. , Okamoto S., 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., 

RA Matsumura K. , Nakajima Y., Mizuno T., Morinaga M . , Sasaki M . , 

RA Togashi T. , Oyama M. , Hata H., Watanabe M . , Komatsu T., 

RA Mizushima-Sugano J., Satoh T. , Shirai Y., Takahashi Y . , Nakagawa K., 

RA Okumura K. , Nagase T., Nomura N. , Kikuchi H., Masuho Y., Yamashita R., 

RA Nakai K. , Yada T . , Nakamura Y. , Ohara 0., Isogai T. , Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs 

RL Nat. Genet. 36:40-45(2004). 

CC -!- FUNCTION: Exhibits arylsul f atase activity and highly specific 

CC endoglucosamine-6- sul f atase activity. It can remove sulfate from 

CC the C-6 position of glucosamine within specific subregions of 

CC intact heparin. Diminishes HSPG (heparan sulfate proteoglycans) 

CC sulfation, inhibits signaling by heparin-dependent growth factors, 

CC diminishes proliferation, and facilitates apoptosis in response to 

CC exogenous stimulation. 

CC -!- BIOPHYSICOCHEMICAL PROPERTIES: 

CC pH dependence: 

CC Optimum pH is 7.0-8.0; 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum and Golgi stack. Also 

CC localized on the cell surface (By similarity) . 

CC -!- TISSUE SPECIFICITY: Expressed at highest levels in testis, 

CC stomach, skeletal muscle, lung, kidney, pancreas, small intestine 

CC and colon. It is also detected in normal ovarian surface 

CC epithelial cells. Down-regulation seen in ovarian carcinoma cell 

CC lines, ovarian cancers, breast, pancreatic, renal and 

CC hepatocellular carcinoma cell lines. 

CC -!- SIMILARITY: Belongs to the sulfatase family. 

CC -' 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AY101175; AAM76860.1; -; mRNA. 

DR EMBL; AF54 55 71; AA03 3315.1; -; mRNA. 

DR EMBL; AB029000; BAA83029.1; -; mRNA. 

DR EMBL; AK074873; BAC11258.1; -; mRNA. 

DR Ensembl; ENSG00000137573 ; Homo sapiens. 

DR HGNC; HGNC:2 03 91; SULF1. 

DR GO; GO:0009986; C:cell surface; IDA. 

DR GO; GO:0005615; C : extracel lular space; NAS . 

DR GO; GO:0004065; F : aryl sulfatase activity; IDA. 

DR GO; GO: 0 03 02 01; P: heparan sulfate proteoglycan metabolism; NAS. 

DR InterPro; IPR000917; Sulfatase. 

DR Pfam; PF00884; Sulfatase; 1. 

DR PROSITE; PS00523; SULFATAS E_l ; 1. 

DR PROSITE; PS00149; SULFATAS E_ 2 ; FALSE_NEG. 

KW Apoptosis; Endoplasmic reticulum; Glycoprotein; Golgi stack 

KW Hydrolase; Signal. 

FT SIGNAL 1 22 Potential. 

FT CHAIN 23 871 Extracellular sulfatase Sulf-1.. 

FT /FTId=PRO_0000033434 . 

FT M0D_RES 87 87 3 -oxoalanine (Cys) (By similarity). 

FT CARBOHYD 64 64 N-linked (GlcNAc. . .) (Potential). 

FT CARBOHYD 111 111 N-linked (GlcNAc. . .) (Potential). 

FT CARBOHYD 131 131 N-linked (GlcNAc. . .) (Potential). 
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"FT 1 


CARROHYD 

XvO vn J. i-^ 


14 8 


148 


N- linked (GlcNAc 


. ) 


(Potential ) 


FT 


CARROHYD 


17 0 


170 


N-linked (GlcNAc 


. ) 


(Potential ) . 


FT 


PARR OH YD 


19 7 


197 


N-linked (GlcNAc. . 


. ) 


(Potential ) . 


FT 


CARBOHYD 


24 0 


240 


N-linked (GlcNAc. . 


. ) 


(Potential ) . 


FT 


CARBOHYD 


623 


623 


N-linked (GlcNAc. . 


. ) 


(Potential ) . 


FT 


CARBOHYD 


773 


7 73 


N-linked (GlcNAc. . 


. ) 


(Potential ) . 


FT 


CARBOHYD 


78 3 


7 83 


N-linked (GlcNAc. . 


. ) 


(Potential ) . 


FT 


MUTAGEN 


87 


88 


CC->AA: Loss of arylsulf atase activity 


FT 








and loss of ability 


to 


modulate 


FT 








apoptosis . 






FT 


CONFLICT 


49 


49 


L. -> P (in Ref . 2) . 






FT 


CONFLICT 


728 


728 


K -> R (in Ref . 4) . 






SQ 


SEQUENCE 


8 71 AA; 


101027 


MW; 9A9 0ADB2 8 03 04 36 4 


CRC6 4; 



Query Match 100.0%; Score 4 729; DB 1; Length 871; 

Best Local Similarity 100.0%; Pred. No. 4.3e-274 ; 

Matches 871; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSL 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1 MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSL 60 

Qy 61 QVMNKTRKIMEHGGATFINAFVTT PMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSWQAM 12 0 

I II II II II II II II II II II III II II II II II II II M II II II II II II II II II II 

Db 61 QVMNKTRKIMEHGGATF INAF VTT PMCCPSRS SMLTGKYVHNHNVYTNNENCSS PS WQAM 12 0 

Qy 121 HEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE 18 0 

Ml II MM II II II MM II III II II II II MM II II II II II II II II II 1 1 II II 

Db 121 HEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYI PPGWREWLGLIKNSRFYNYTVCRNGI KE 180 

Qy 181 KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPN 24 0 

I II II II MM II II II II II III II II II II II II I! II MM II M M II II II II II 

Db 181 KHGFDYAKDYFTDLITNESINYFKMSKRMY PHRPVMMVISHAAPHGPEDSAPQFSKLYPN 24 0 

Qy 2 41 ASQHI TPSYNYAPNMDKHWIMQYTGPMLPI HMEFTNILQRKRLQTLMSVDDS VERLYNML 30 0 

I I I I I M I I I I II I I I I I I II I II II I I I I II I I I I II II I I II II I I II II II I I I I II 
Db 241 ASQHI TPSYNYAPNMDKHWIMQYTGPMLPI HMEFTNILQRKRLQTLMSVDDS VERLYNML 30 0 

Qy 301 VETGELENTYI IYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVE PGSI VPQI VL 360 

I I I I I II II I I M II II II I I II I I I I I I I II II I I I I II II I I II II I I II I I II II II 
Db 3 01 VETGELENTYI IYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVE PGSI VPQI VL 36 0 

Qy 3 61 NIDLAPTILDIAGLDTPPDVDGKS VLKLLDPEKPGNRFRTNKKAKI WRDTFLVERGKFLR 42 0 

I II II II II II II II II II II III MM INI II II MM II II II II II II INI II II 

Db 3 61 NIDLAPTILDIAGLDTPPDVDGKS VLKLLDPEKPGNRFRTNKKAKI WRDTFLVERGKFLR 42 0 

Qy 4 21 KKE ES SKNI QQ SNHL PK YERVKEL CQQARYQTAC EQ PGQKWQ CI EDTS GKLR IH KC KG PS 48 0 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 II I II 1 1 1 1 II II 1 1 1 1 1 1 II II II 1 1 1 1 1 1 II II II 1 1 

Db 4 21 KKEESSKNIQQSNHLPKYERVKELCQQARYQTACEQPGQKWQCIEDTSGKLRIHKCKGPS 48 0 

Qy 4 81 DLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHT 54 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 81 DLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPKYKPR FVHT 54 0 

Qy 541 RQTRSLSVEFEGEIYDINLEEEEELQVLQPRNIAKRHD EGHKGPRDLQASSGGNRGRMLA 600 

1 1 1 II II II II II 1 1 II 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 II II 1 1 II II II II 1 1 II M II 1 1 II 

Db 541 R QTRS LS VE FE GE I Y DI NL EE EEE LQ VL QP RN I AKRHD EGHKGPRDLQASSGGNRGRMLA 600 

Qy 6 01 DSSNAVGPPTTVRVTHKCFILPNDSIHCERELYQSARAWKDHKAYIDKEI EALQDKIKNL 66 0 

I II II II II II II II II II II III II II II II II II II II II II II II II II II II II II 
Db 6 01 DSSNAVGPPTTVRVTHKCFILPNDS I HCERELYQSARAWKDHKAYIDKEI EALQDKIKNL 66 0 
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Qy 


661 


REVRGHLKRRKPEECSCSKQSYYNKEKGVKKQEKLKSHLHPFKEAAQEVDSKLQLFKENN 


720 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


REVRGHLKRRKPEECSCSKQSYYNKEKGVKKQEKLKSHLHPFKEAAQEVDSKLQLFKENN 


720 


Qy 


721 


RRRKKERKEKRRQRKGEECSLPGLTCFTHDNNHWQTAP FWNLGSFCACTSSNNNTYWCLR 


780 




i i i i i i i i i i i i i i i i i i i i i i i i i i i \ i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


RRRKKERKEKRRQRKGEECSLPGLTCFTHDNNHWQTAP FWNLGSFCACTSSNNNTYWCLR 


780 


Qy 


781 


TVNETHNFLFCEFATGFLEYFDMNTDPYQLTNTVHTVERGILNQLHVQLMELRSCQGYKQ 


84 0 






1 MM II MM MM II II II III INI II II 1! MM II II II II INI MM 1! II 1 




Db 


781 


TVNETHNFLFCEFATGFLEYFDMNTDPYQLTNTVHTVERGILNQ LHVQLMELRSCQGYKQ 


84 0 


Qy 


841 


CNPRP KN LD VGNKDGGS YDLH RGQ LWDG WEG 871 








Mill II llllllll II MM III MMMI 




Db 


841 


CNPRP KNLD VGNKDGGS YDLH RGQ LWDG WEG 871 
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Database : Publ ished_Applicat ions_AA_Main : * 

1 : /EMC_Celerra_SIDS3 /ptodata/2 /pubpaa/US07_PUBCOMB .pep : * 

2 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US08_PUBCOMB .pep : * 

3 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US09_PUBCOMB .pep : * 

4 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US10A_PUBCOMB.pep:* 

5 : /EMC_Celerra_SIDS3 /ptodata/2 /pubpaa/US10B_PUBCOMB. pep: * 

6 : /EMC_Celerra_SIDS3 /ptodata/2 /pubpaa/USll_PUBCOMB .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

Result Query 



NO . 


Score 


Match 


Length 


DB 


ID 










Description 


1 


4 72 9 
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4 
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Sequence 


31, Appl 


35 


3179 . 5 


67 


2 


870 


5 


us 


-10 


-933 


-025 


" 6 X 


Sequence 


'6, Appli 


36 


3179. 5 


67 


2 


870 


5 


us 


-10 


-933 


-025 


Sequence 


15, Appl 


37 


3179. 5 


67 


2 


870 


6 


us 


-11 


-219 


-360 


-6 


Sequence 


6, Appli 


38 


3179. 5 


67 


2 


87 0 


6 


us 


-11 


-219 


-360 


-650 
-64 9 * 


Sequence 


15, Appl 


39 


3179. 5 


67 


2 


870 


6 


us 


-11 


-264 


-096 


Sequence 


6 50, App 


40 


3178. 5 


67 


2 


870 


3 


us 


-09 


-833 


-245 


Sequence 


64 9, App 


41 


3178. 5 


67 


2 


870 


4 


us 


-10 


-264 


-237 


-2754 


Sequence 


2754, Ap 


42 


3178. 5 


67 


2 


870 


6 


us 


-11 


-264 


-096 


-649 


Sequence 


64 9, App 


43 


3175.5 


67 


1 


870 


3 


us 


-09 


-970 


-287 


-2 


Sequence 


2 , Appl i 


44 


3175. 5 


67 


1 


870 


6 


us 


-11 


-066 


-098 


-2 


Sequence 


2, Appli 
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Database : Published_Appl icat ions_AA_New : * 

1 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/US09_NEW_PUB . pep : * 

2 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/US06_NEW_PUB . pep : * 

3 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/US07_NEW_PUB . pep : * 

4 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/US08_NEW_PUB .pep : * 

5 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/PCT_NEW_PUB . pep: * 

6 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/US10_NEW_PUB . pep : * 

7 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/USll_NEW_PUB .pep : * 

8 : /EMC_Celerra_SIDS3 /ptodata/l/pubpaa/US60_NEW_PUB .pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 



SUMMARIES 



•% 

Query 



to. 


Score 


Match 


Length 


DB 


ID 










Description 


1 


4729 


100 


0 


871 


7 


US 


-11 


-313 


-836 


t 

-11 


Sequence 


11, Appl 


2 


4729 


100 


.0 


871 


7 


US 


-11 


-105 


-233 


-140> 
-96^ # 


Sequence 


14 0 , App 


3 


4729 


100 


.0 


871 


7 


us 


-11 


-043 


-842 


Sequence 


96, Appl 


4 


4725 


99 


. 9 


871 


7 


us 


-11 


-043 


-842 


r840^ 


Sequence 


8 4 0, App 


5 


4722 


99 


.9 


871 


6 


us 


-10 


-700 


-439 


-103*\> 


Sequence 


103, App 


6 


4722 


99 


.9 


871 


7 


us 


-11 


-316 


-132 


-1 ^ 


Sequence 


1 , Appl i 


7 


4406.5 


93 


.2 


870 


7 


us 


-11 


-316 


-132 


-2 

-98 ^ 


Sequence 


2 , Appl i 


8 


4237 


89 


.6 


791 


7 


us 


-11 


-043 


-842 


Sequence 


9 8 , App 1 


9 


4105 


86 


.8 


867 


7 


us 


-11 


-316 


-132 


-3 \ 


Sequence 


3 , Appl i 


10 


4102 


86 


.7 


790 


7 


us 


-11 


-043 


-842 


-97 i\ 


Sequence 


97, Appl 


11 


3999 


84 


.6 


751 


7 


us 


-11 


-043 


-842 


-1006 ^ 


Sequence 


1006, Ap 


12 


3179. 5 


67 


2 


870 


7 


us 


-11 


-316 


-132 


-4 V 


Sequence 


4 , Appl i 


13 


2244 


47 


5 


416 


7 . 


us 


-11 


-043 


-842 


-99 


Sequence' 


99, Appl 


14 


2142 


45 


.3 


410 


7 


us 


-11 


-043 


-842 


-100 


Sequence 


100, App 


15 


1025 


21 


7 


210 


7 


us 


-11 


-043 


-842 


-101 


Sequence 


101, App 


16 


73 0 


15 


4 


14 5 


7 


us 


-11 


-043 


-842 


-102 


Sequence 


102, App 


17 


280 


5 


9 


536 


6 


us 


-10 


-527 


-101 


-12 


Sequence 


12, Appl 


18 


238 


5 


0 


40 


7 


us 


-11 


-043 


-842 


-841 


Sequence 


8 41, App 


19 


208 


4 


4 


515 


6 


us 


-10 


-196 


-749 


-76 


Sequence 


7 6, Appl 


20 


207. 5 


4 


4 


599 


7 


us 


-11 


-313 


-836 


-13 


Sequence 
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Database : UniProt__7 . 2 : * 

1: uniprot_sprot : * 
2: uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



Jo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


4729 


100 . 


. 0 


871 


1 


SULF1_HUMAN 


Q8 iwu6 


homo sapien 


2 


4725 


99 


. 9 


871 


2 


Q7Z2W2_HUMAN 


Q7z2w2 


homo sapien 


3 


4450 


94 


. 1 


869 


2 


Q3 2KH2_CANFA 


Q32kh2 


canis famil 


4 


4406 . 5 


93 


. 2 


870 


1 


SULF1JRAT 


Q8vi6 0 


rattus norv 


5 


4393 . 5 


92 


. 9 


870 


1 


SULF1_M0USE 


Q8k007 


mus musculu 


6 


4159. 5 


88 


. 0 


• 1079 


2 


Q6ZPZ0_MOUSE 


Q6zpz 0 


mus musculu 


7 


4105 


86 


.8 


867 


1 


SULFl_COTCO 


Q90xb6 


coturnix co 


8 


3478 


73 


. 5 


874 


2 


Q6EFA0_BRARE 


Q6ef a0 


brachydanio 


9 


3473 . 5 


73 


. 5 


892 


2 


Q6 EF9 9_BRARE 


Q6ef 9 9 


brachydanio 


10 


3413 . 5 


72 


. 2 


1099 


2 


Q6 EFA 1_BRARE 


Q6e fa 1 


brachydanio 


11 


3330 


70 


. 4 


644 


2 


Q6NUJ 9_HUMAN 


Q6nuj 9 


homo sapien 


12 


3210 


67 


.9 


877 


2 


Q5UAX9_COTCO 


Q5uax9 


coturnix co 


13 


3195 


67 


. 6 


1160 


2 


Q4SFB 1_TETNG 


Q4sfbl 


tetraodon n 


14 


3179. 5 


67 


.2 


870 


1 


SULF2_HUMAN 


Q8iwu5 


homo sapien 


15 


3179. 5 


67 


.2 


870 


2 


Q5JYE 1_HUMAN 


Q5jyel 


homo sapien 


16 


3165 


66 


.9 


869 


2 


Q3 2KH1_CANFA 


Q32khl 


canis famil 


17 


314 9 


66 


.6 


875 


1 


SULF2_MOUSE 


Q8cfg0 


mus musculu 


18 


314 9 


66 


.6 


94 8 


2 


Q6ZPV3_MOUSE 


Q6zpv3 


mus musculu 


19 


314 6 


66 


.5 


875 


2 


Q3TNM3JVIOUSE 


Q3tnm3 


m 12 days e 


20 


3144 


66 


.5 


875 


2 


Q3L4 7 2_RAT 


Q31472 


rattus norv 


21 


3136 


66 


.3 


875 


2 


Q6GL2 9_XENTR 


Q6gl29 


xenopus tro 


22 


3107 


65 


.7 


866 


2 


Q3 2KJ3_RAT 


Q32kj3 


rattus norv 


23 


3086 


65 


.3 


867 


2 


Q6EF9 7_BRARE 


Q6ef97 


brachydanio 


24 


3067 


64 


. 9 


885 


2 


Q6EF9 8_BRARE 


Q6ef 98 


brachydanio 


25 


2777 


58 


.7 


873 


2 


Q7ZVU8_BRARE 


Q7zvu8 


brachydanio 


26 


2721. 5 


57 


.5 


879 


2 


Q4SZ4 1_TETNG 


Q4SZ41 


tetraodon n 


27 


2701 


57 


. 1 


1239 


2 


Q4 SR7 7_TETNG 


Q4sr77 


tetraodon 
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Database 



PIR_80 : * 
pirl: ' 
pir2 : ' 
pir3 : ' 
pir4 : 1 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB 



ID 



Description 



1 


1607 


34 


0 


709 


2 


T16584 


hypothetical prote 


2 


829. 5 


17 


.5 


552 


1 


KJHUGU 


N-ace tylglucosamin 


3 


468 . 5 


9 


.9 


64 9 


2 


S43229 


arylsulfatase (EC 


4 


440 


9 


.3 


63 9 


2 


T46577 


arylsulf atase (EC 


5 


355 


7 


5 


646 


1 


KJKM 


arylsulfatase (EC 


6 


309 


6 


5 


497 


2 


G65169 


probable sulfatase 


7 


307 


6 


5 


497 


2 


C91206 


probable sulfatase 


8 


307 


6 


.5 


497 


2 


E86052 


probable sulfatase 


9 


276. 5 


5 


.8 


571 


2 


E64903 


arylsulfatase homo 


10 


276. 5 


5 


.8 


571 


2 


G90891 


probable sulfatase 


11 


275. 5 


5 


.8 


535 


2 


AF0103 


probable sulfatase 


12 


275 . 5 


5 


.8 


571 


2 


B85726 


probable sulfatase 


13 


266 


5 


.6 


557 


2 


AF0370 


probable sulfatase 


14 


254 . 5 


5 


.4 


55 0 


1 


KJHUID 


iduronate-2 -sul fat 


15 


24 0 


5 


.1 


563 


2 


A4 715 3 


iduronate-2 -sul fat 


16 


239. 5 


5 


.1 


517 


2 


AB0102 


probable sulfatase 


17 


238. 5 


5 


.0 


583 


1 


KJHUAC 


steryl- 
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Database : issued_Patents_AA: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/iaa/5_COMB.pep: * 

2 : /EMC_Celerra_SIDS3/ptodata/2/iaa/6_COMB.pep: * 

3 : /EMC_Celerra_SIDS3/ptodata/2/iaa/7_COMB.pep: * 

4 : /EMC_Celerra_SIDS3/ptodata/2/iaa/H_COMB.pep: * 

5 : /EMC_Celerra_SIDS3 /ptodata/2 /iaa/PCTUS_COMB . pep : * 

6 : /EMC_Celerra_SIDS3 /ptodata/2 /iaa/RE_COMB .pep : * 

7 : /EMC_Celerra_SIDS3 /ptodata/2 /iaa/backf ilesl . pep: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


4729 


100 


. 0 


871 


2 


US 


-09 


-773 


-426A-3 


Sequence 


3, Appli 


2 


4729 


100 


. 0 


871 


2 


US 


-10 


-314 


-881 -3 


Sequence 


3 , Appl i 


3 


4729 


100 


. 0 


871 


2 


US 


-09 


-495 


-823 -3 


Sequence 


3 , Appli 


4 


4729 


100 


. 0 


871 


3 


US 


-10 


-426 


-776 -11 


Sequence 


1 1 , App 1 


5 


4460 


94 


. 3 


1611 


2 


us 


-09 


-668 


-673B-16 


Sequence 


16, Appl 


6 


4460 


94 


. 3 


1611 


2 


us 


-10 


-389 


-532 -16 


Sequence 


1 6 , App 1 


7 


4105 


86 


. 8 


867 


2 


us 


-09 


-668 


-673 B- 2 


Sequence 


2 , App 1 i 


8 


4105 


86 


.8 


867 


2 


us 


-10 


-389 


-532 -2 


Sequence 


2 , Appl i 


9 


3160 


66 


. 8 


867 


2 


us 


-09 


-991 


-181 -84 


Sequence 


8 4 , App 1 


10 


3160 


66 


.8 


867 


2 


us 


-09 


-990 


-444 -84 


Sequence 


8 4 , App 1 


11 


3160 


66 


. 8 


867 


2 


us 


-09 


-997 


-333 -84 


Sequence 


8 4 , App 1 


12 


3160 


66 


8 


867 


2 


us 


-09 


-992 


-598 -84 


Sequence 


8 4 , App 1 


13 


3160 


66 


. 8 


867 


2 


us 


-09 


-989 


-735 -84 


Sequence 


84, Appl 


14 


3160 


66 


8 


867 


3 


us 


-09 


-989 


-726 -84 


Sequence 


84, Appl 


15 


3160 


66 


.8 


867 


3 


us 


-09 


-997 


-514 -84 


Sequence 


8 4 , App 1 


16 


3160 


66 


8 


867 


3 


us 


-09 


-989 


-728 -84 


Sequence 


84, Appl 


17 


3160 


66 


.8 


867 


3 


us 


-09 


-997 


-349-84 


Sequence 


84, Appl 


18 


3160 


66 


.8 


867 


3 


us 


-09 


-997 


-653 -84 


Sequence 


84, Appl 


19 


3160 


66 


8 


867 


3 


us 


-09 


-989 


-293A-84 


Sequence 


84 , Appl 


20 


2418 


51 


. 1 


690 


2 


us 


-09 


-886 


-319A-36 


Sequence 


36, Appl 


21 


1950 


41 


.2 


360 


2 


us 


-09 


-810 


-347 -4 


Sequence 


4 , Appl i 


22 


1927. 5 


40 


.8 


372 


2 


us 


-09 


-810 


-347 -2 


Sequence 


2 , Appl i 


23 


1681 


35 


.5. 


307 


2 


us 


-09 


-810 


-347-5 ' 


Sequence 


5 , Appl i 


24 


1607 


34 


.0 


709 


2 


us 


-09 


-668 


-673B-3 


Sequence 


3 , App 1 i 


25 


1607 


34 


. 0 


709 


2 


us 


-10 


-389 


-532 -3 


Sequence 


3 , Appl i 


26 


1245 


26 


.3 


455 


2 


us 


-09 


-668 


-673B-14 


Sequence 


14, Appl 


27 


1245 


26 


.3 


455 


2 


us 


-10 


-389 


-532 -14 


Sequence 


14, Appl 


28 


1172 


24 


.8 


309 


2 


us 


-09 


-810 


-347-6 


Sequence 


6 , Appl i 


29 


1021 


21 


.6 


196 


2 


us 


-09 


-668 


-673B-18 


Sequence 


18, Appl 


30 


1021 


21 


.6 


196 


2 


us 


-10 


-389 


-532 -18 


Sequence 


18, Appl 


31 


956. 5 


20 


.2 


470 


2 


us 


-09 


-886 


-319A-35 


Sequence 


35, Appl 


32 


807 


17 


1 


160 


2 


us 


-09 


-668 


-673B-7 


Sequence 


7, Appli 


33 


807 


17 


.1 


160 


2 


us 


-10 


-389 


-532 -7 


Sequence 


7 , Appl i 


34 


801. 5 


16 


9 


510 


1 


us 


-08 


-484 


-493 -11 


Sequence 


11, Appl 


35 


801. 5 


16 


9 


510 


1 


us 


-08 


-484 


-494 -11 


Sequence 


11, Appl 


36 


801. 5 


16 


9 


510 


1 


us 


-08 


-345 


-212 -11 


Sequence 


11, Appl 


37 


801. 5 


16 


9 


510 


2 


us 


-09 


-249 


-003 -11 


Sequence 


11, Appl 


38 


801. 5 


16 


9 


510 


2 


us 


-09 


-685 


-844 -11 


Sequence 


11, Appl' 


39 


506 


10 


7 


96 


2 


us 


-09 


-668 


-673B-20 


Sequence 


20, Appl 


40 


506 


10 


7 


96 


2 


us 


-10 


-389 


-532 -20 


Sequence 


20, Appl 


41 


406.5 


8 


6 


387 


2 


us 


-09 


-270 


-767-33219 


Sequence 


33219, A 


42 


406.5 


8 


6 


387 


2 


us 


-09 


-270 


-767 -48436 


Sequence 


48436, A 


43 


381.5 


8 


1 


520 


2 


us 


-09 


-773 


-426A-10 


Sequence 


10, Appl 
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44 381.5 8.1 520 2 US-10 -314 -881 -10 Sequence 10, Appl 

45 334.5 7.1 552 2 US - 09 - 773 -426 A- 9 Sequence 9, Appli 
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Database : A_Geneseq_8 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2 00 0s : * 

4 : geneseqp2 001s : * 

5 : geneseqp2002s : * 

6: geneseqp2 003as : * 

7 : genes eqp2 00 3bs : * 

8: geneseqp2004s : * 

9: geneseqp2005s : * 

10: geneseqp2006s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


4729 


100 


. 0 


871 


3 


AAB00 191 


Aab00191 


Breast ca 


2 


4729 


100 


. 0 


871 


4 


AAB85481 


Aab85481 


Human 


235 


3 


4729 


100 


. 0 


871 


5 


ABG.69 64 0 


Abg69640 


Human 


sec 


4 


4729 


100 


. 0 


871 


5 


ABG7183 5 


Abg71835 


Human 


SUL 


5 


4729 


100 


. 0 


871 


6 


ABR47 506 


Abr47506 


Breast ca 


6 


4729 


100 


. 0 


871 


6 


ABU63 345 


Abu63345 


Human 


sul 


7 


4729 


100 


. 0 


871 


7 


ADH60 968 


Adh60968 


Human 


235 


8 


4 72 9 


100 


. 0 


871 


8 


ADH58 92 6 


Adh58926 


Glucosami 


9 


4 72 9 


100 


. 0 


871 


8 


ADJ57 910 


Adj 57910 


Human 


235 


10 


4 72 9 


100 


. 0 


871 


8 


ADN16 18 9 


Adnl6189 


Human 


sul 


11 


4729 


100 


. 0 


871 


8 


ADQ29 65 0 


Adq29650 


Human 


col 


12 


4 729 


100 


. 0 


871 


8 


ADR21 25 0 


Adr21250 


Human 


ext 


13 


4729 


100 


. 0 


871 


8 


ADU4 7 47 2 


Adu47472 


Human 


235 


14 


4729 


100 


. 0 


871 


9 


AEA54 851 


Aea54851 


Human 


turn 


15 


4729 


100 


. 0 


871 


9 


AEC03 934 


Aec03934 


Human 


bre 


16 


4729 


100 


. 0 


871 


10 


AEF69962 


Aef6 9962 Colore ct 


17 


4725 


99 


.9 


871 


9 


AEC04654 


Aec04654 


Human 


bre 


18 


4722 


99 


.9 


871 


9 


ADY25735 


Ady25 73 5 


SULF re la 


19 


4722 


99 


.9 


871 


9 


ADZ09831 


Adz09831 


Human 


bre 


20 


4722 


99 


.9 


871 


9 


AEA04458 


Aea0445 8 


Human 


pro 


21 


4722 


99 


. 9 


871 


9 


ADZ87322 


Adz87322 


Human 


sul 


22 


4722 


99 


. 9 


871 


9 


AEE39581 


Aee39581 


Human 


sul 


23 


4712 


99 


.6 


871 


9 


AEE39599 


Aee39599 


Human 


sul 


24 


4608 


97 


.4 


871 


8 


ADN16208 


Adnl6208 


Mouse 


sul 


25 


4474 . 5 


94 


.6 


828 


8 


ABM85153 


Abm85153 


Human 


dia 


26 


4460 


94 


.3 


818 


5 


AAU69417 


Aau69417 


Lung smal 


27 


4460 


94 


.3 


818 


8 


ADQ17726 


Adql7 72 6 


Human 


sof 


28 


4460 


94 


.3 


1611 


4 


AAE00438 
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5 ABG63 903 
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